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Abstract 

We analyze the problem of transmitting information to multiple users over a shared wireless channel. 
The problem of resource allocation (RA) for the users with the knowledge of their channel state 
information has been treated extensively in the literature where various approaches trading off the users' 
throughput and fairness were proposed. The emphasis was mostly on the time-sharing (TS) approach, 
where the resource allocated to the user is equivalent to its time share of the channel access. In this 
work, we propose to take advantage of the broadcast nature of the channel and we adopt superposition 
coding (SC) — known to outperform TS in multiple users broadcasting scenarios. In SC, users' messages 
are simultaneously transmitted by superposing their codewords with different power fractions under a 
total power constraint. The main challenge is to find a simple way to allocate these power fractions to 
all users taking into account the fairness/throughput tradeoff. We present an algorithm with this purpose 
and we apply it in the case of popular proportional fairness (PF). The obtained results using SC are 
illustrated with various numerical examples where, comparing to TS, a rate increase between 20% and 
300% is observed. 
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I. Introduction 

In this paper, we derive an RA scheme for downlink multi-user communications where various 
utility functions may be applied. The distinctive feature of the analyzed scheme is that it is based 
on SC. Unlike the popular and well studied TS approach, where at each time instant only one 
user is receiving data, with SC many users may receive their respective payload simultaneously. 

In downlink communications over time-varying channels, RA depends on the instantaneous 
channel condition between the base-station (BS) and the user (or mobile-stations (MS)). This 
usually results in transmission schemes which allocate resources (time, frequency, power) to the 
user which experiences the most favourable channel conditions. 

In presence of multiple users, it was shown in [1] that the optimal strategy to maximize 
the total throughput (sum-rate of all users) is to schedule the user with the best link during 
each transmission unit. This multiuser diversity (MD) [0 Ch. 6.6] maximizes the overall system 
throughput by allocating the shared resource to the user that can best exploit it. However, this 
approach raises a "fairness" issue since it would result in shared resources being monopolized 
by the users with the best channel conditions (e.g., with a direct link to the BS, or at a short 
range from it), while the user with poor channel conditions would rarely access the channel 
affecting considerably his throughput. 

Total throughput enhancement and fairness are hence crucial but conflicting criteria in the 
design of optimal RA schemes. 

To address this issue, many utility-based approaches — where utility represents a function of 
user's throughput — have been proposed in the literature to consider both fairness and throughput 
in the design of scheduling and RA algorithms. Among them, PF [3 J based on the logarithmic 
utility function is a well-known criterion introduced to balance between throughput and fairness. 
Other approaches adopt different variants of the utility function but most of them can be reduced 
to the maximization of the weighted sum of users' throughputs. 

Using PF (or any other utility-based criterion) in the case of TS leads to well-known and 
simple-to-implement results with the channel being allocated to a single user at any transmission 
time J3), flU. On the other hand, it is also known that TS approach is outperformed by SC 
[51 Ch. 15.1.3] when communicating over shared (broadcast) channels. In SC, the transmitter 
splits the available power among the multiple users, superimposes the resulting codewords, and 
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broadcasts them on the downlink channel. The underlying assumption is that the users are capable 
of decoding SC signals via successive decoding. This is not a very restrictive assumption as the 
so-called hierarchical modulation, closely related to SC, is nowadays included in communication 
standards, e.g., [|6). 

SC-based RA for cooperative communications was analyzed in 0, (HI but the formal analysis 
of multi-user SC was not addressed therein. It was also studied in 0, |[T0l . where optimal 
solutions were derived using the approach of [fTT|. [fl2l With respect to fl9]|— flTTJ our contributions 
are the following: 

• We derive the power-fraction allocation algorithm from the Karush-Kuhn-Tucker (KKT) 
conditions applied directly to the RA problem at hand which is similar in spirit to the 
approach used by lfT3l for the case of TS. The resulting, sorting-like algorithm is very 
simple and has the complexity linear in the number of users (hundreds of users are easily 
dealt with). Our approach does not require the utility-based formalism of [fTTT|. [fT2j: it is 
hence simpler to derive and reveals the underlying structure of relationships, which lead to 
the simple algorithm we propose. 

• In the numerical examples we show that the gains provided by SC combined with PF 
criterion can lead to a multi-fold throughput increase for certain classes of users without 
penalty to the others. Moreover, we show that, in a single-cell scenario, and with a growing 
number of users, the total throughput improves by up to 50% with respect to TS. 

• We observe that almost the entire power is distributed amongst just a few users. Motivated 
by this observation, we propose to apply SC only to a limited number of users and propose 
the respective algorithms in this case. 

• We show that the two-user SC is not only much more practical than the general multi-level 
superposition, but it also achieves most of the gains provided by unconstrained RA. 

The rest of the paper is organized as follows. In Sec. [Ill we introduce the adopted transmission 
model, and in Sec. [III] we discuss RA principles. We develop a simple algorithm to define the 
power allocation policy for SC in Sec. |IVJ where we also analyze the case of power allocation 
under constraints on the number of scheduled users. Conclusions are drawn in Sec. [V] 
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II. Transmission Model 



We consider the scenario where the BS has to send information to L distinct users. We consider 
the fiat block-fading channel model commonly used in the analysis of wireless systems. Namely, 
we assume that at each discrete time instant n, the signal received by the Zth user is modeled as 



where x[n] is the unitary-power signal emitted by the BS, zi[n] is the zero-mean unitary variance 
random process modeling noise/interference, and snrjn] is the signal-to-noise ratio (SNR) at the 
l-th receiver. 

In the block-fading model, for a given user I, the SNR is modelled as a white random 
process SNRj[n]. Thus, the SNR remains constant for the duration of the entire block but varies 
independently between blocks. 

While we do not need to assume any particular distribution to characterize the fading, we 
focus on the Rayleigh distribution in the numerical examples, that is, the probability density 
function (PDF) of SNR is given by 



where snr"/ is the average SNR of the Z-th link. 

The data of each user is assumed available at the BS at any time instant n (the so-called 
"saturation" scenario) and it is delay-insensitive, thus we can consider long-term averages as 
relevant performance measures. Moreover, we assume that at the beginning of each transmission 
block, each user informs the BS about the value of its instantaneous SNR snrj[n], through a 
perfect feedback channel. We do not consider the related transmission overhead here as this 
issue is out of the scope of the paper. 

These assumptions allow us to focus on the main problem addressed in this work; namely, 
multi-user resource allocation, and in particular - the one based on SC. 

The BS at time instant n forms the signal x[n] using the modulation/coding scheme (MCS) 
(/>(•) so that the rate conveyed to the user I is given by 




(1) 




(2) 



n[n] = (f)i(snr[n],p[n\), 



(3) 
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where p[n] gathers all the parameters defining the MCS and 

sm[n] = [snr 1 [n],...,snr z ,[n]]. (4) 
RA consists, therefore, in choosing the appropriate vector p[n]. 

The simple and popular multi-user MCS relies on time-sharing (TS), where each user is 
assigned a fraction of the available transmission time so that 

4>J S (snr , [ra],p[n]) = pi[n] log(l + snrj[ra]), (5) 

where, for simplicity, we assume that MCS uses a capacity-achieving coding. That is, we con- 
sider the case, where signals x; are obtained from infinite-length, randomly generated Gaussian 
codebook. These idealistic assumptions allow us to focus on the allocation strategies and provide 
upper limits on the rates achievable for any practical coding scheme. 

In the context of TS, RA consists most often in dedicating the entire transmission time to one 
particular user. Then, scheduling (i.e., determining which user should transmit) is equivalent to a 
RA. The simplest RA scheme is based on the so-called round-robin (RR) approach where each 
user is assigned periodically (with period L) to the entire transmission block, thus 

p[n] = S t KR [n] (6) 
t RR [n] = [n] L + 1 (7) 

where we use 5 t = [0, . . . , 0, 1, 0, . . . , 0] to denote the L-length vector with a non-zero element 
at position t, and f-J L denotes the modulo-L operation. 

Then, each user occupies the channel during exactly the same fraction 1/L of the overall 
transmission time and its throughput is given by 

#, = -|-Es NRi [log(l + SNR*)]. (8) 

We note that the same result in terms of throughput will be obtained assigning each user a 
portion pi — 1/L of the block (if we ignore the practical limitation related to distributing the 
finite time among L users). 
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III. Resource Allocation 

RA strategies may be defined via a function p = p(snr) designed to maximize the sum of 
the so-called utility functions defined over users' throughputs Ri 

L 

p(snr) = argmax ^ U(Ri), (9) 

P(snr) l=1 

where 

^ = E SNR ^(SNR,p(SNR))] (10) 

and 

SNR= [SNRi,...,SNR L ] (11) 

is the random vector modeling ©. 

For example, using U (R) = R corresponds to the maximization of the aggregate throughput 
R = Yld=i Ri an d it can be shown that, then, the optimal RA is defined via TS with only one user 
(having the maximum instantaneous rate (MR) or -equivalently, the maximum SNR) scheduled 
for transmission within the block [4], i.e., 

p[n] = d tM R [n] (12) 

t MR [n] = argmax snrj[n], (13) 

ie{i,...,L} 

However, RA in (fT2l) -(fT3l results in a situation where the high-SNR users receive the highest 
throughput Ri, while weak-SNR users obtain lower throughputs Ri. This is considered "unfair" 

a. 

To address this issue, various criteria have been proposed in the literature aiming to improve 
the fairness of RA algorithms. Among them, the PF criterion is arguably one of the most popular 
[fT3l and corresponds to ® based on the utility function 

U(R) = log(R). (14) 

On the other hand, the max-min optimization 

p = argmax min {RA (15) 
P ie{i,...,L} 

where resources are allocated so that the weakest user is prioritized, tend to yield equal-rate 
(ER) RA. 
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A. On-line adaptation 

Using U(R) = R, the function p(snr) is defined in closed-form via (fl"2l) and (fl"3T ) but this is 
rarely the case. In fact, it is rather difficult to find the optimal mapping p(snr) for the popular 
utility function, such as the one in (fl4l) corresponding to PF. The main difficulty is to calculate 
the expectation (flOl) in closed form. 

To overcome this problem, we may use estimates of the throughput based on temporal averages 

ma 

. w-i 

= wE r An-t] 
t=o 

= gWz^zia + *[„ _ !]. (16) 

Using (fT6l) in ©, finding the optimal allocation parameters for the PF utility function (fl4)) 
can be formulated as the following optimization problem lTT3Tl 



m = argmax £ V ( S,\n - 1] 4- - n[n - W] \ ^ 

Pin] ^ V W J 

Further, for long observation windows, — >■ oo, i.e., when 

we may use the first-order approximation U(R + r) w f/(-R) + U'(R) ■ r, which yields 

L 

p[n] ~ argmax t/' [n — l])rj[n] (19) 

p[n] l=1 
L 

= argmax /?; [n]rj [n] (20) 
p[n] {=1 

where the terms independent of p and the common multiplication factor W ( not affecting the 
optimization results) were removed. 

The form of (l20T ). emphasizes that the utility-function based approach may be reduced to the 
optimization of the sum of instantaneous rates ri[n] weighted by f3i{n] = U'(Ri[n — 1]) IfTOl . 

We emphasize that the adaptation rule (l20l is valid irrespectively of the adopted utility function 
or MCS, that is, it may be applied for various forms of 0;(snr, p) or U(R). In particular, for 
U(R) = R we recover the max-SNR (i.e., also max-instantaneous rate ri[n\) solution we have 
shown in (fT2b-(fT3T>. 
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Example 1 (Resource allocation in TS): Considering TS again, we have to use MCS with 
rates defined by ©, thus (l20t becomes 

L 

p PF ~ TS [n] = argmax^ PiPi[n] log 2 (l + snrj[n]) (21) 
p i=i 



L 

s.t. ^p, = l,p,>0. (22) 

i=i 

It is easy to see that (l2"TT) is solved by scheduling only one user [fT3l indexed by 



t[n] = argmax/^fn] log 2 (l + snrj[n]). (23) 

l€{l,...,L} 

Then, if we opt for using the PF utility function ([14)) . we obtain U'(R) = R~ x , thus [3i{n] = 
l/Ri[n — 1] and (|23T) becomes 

i PF - TS [n] = argmax— ^4 (24) 

ie{i,...,L} — 1] 

Thus, the optimal solution is given by 

p PF - TS [n] = 6** w . (25) 



This is the well-known proportionally fair TS (PF-TS) resource allocation [1141 . The choice of 
the scheduled user depends on the ratio (proportion) between the instantaneous achievable rate 
log(l +snr/[n]) and the throughput Ri[n— 1] w R\. Thanks to the normalization by R[, the users 
with relatively small average SNR (and thus also relative small value of Rf) are granted access 
to the channel more frequently than in the non-proportional max-SNR scheduling (fT2l) -(fT3l. 

We note that we do not calculate explicitly the expectation (flOl) . Instead, by applying (1231) and 
(|25T) . the PvA algorithm "learns" through the local optimization (|2TT) what the globally optimal 
solution is. 

An important common feature of all mentioned RA schemes based on TS is that, in the nth 
block, only one user is scheduled for transmission, that is, (1231) is valid independently of the 
chosen utility-function. 

IV. Optimal RA with superposition coding 

We will now take the analysis of RA based on utility-function to a more involved multi-user 
MCS well suited for the wireless downlink transmission. While we use the PF utility in the 
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examples, i.e., /3/[n] = l/Ri[n — 1], the presented solutions will be general, and remain valid 
when the utility function changes. 

To motivate the adoption of SC, and before defining the RA framework, we outline the 
principle of encoding/decoding based on SC. 

A. SC Broadcasting Principles 

From an information-theoretic point of view, sending information to the users over a shared 
channel (i.e., where the users receive the same broadcasted signal) is done optimally via SC J51 
Ch. 15.1.3]. 

In the case of L = 2 users, the solution that maximizes the sum of weighted rates is obtained 
by transmitting a superposition of the codewords, that is 

x[n] = ^/p[xi[n} + v ^i 2 ["] (26) 

where Xi[n] and x 2 [n] are the unitary power signals of each user, pi and p 2 are their power 
fractions, and we impose the constraint pi + p 2 = 1 so the emitted signal x[n] has a unitary 
power. 

We assume without loss of generality that sni^ < snr 2 . The decoding can be performaned as 
follows: the weak-SNR user decodes only its own message Xi[n] (treating the signal x 2 [n] as 
interference). Since it receives the signal 

yi[n] = y/snripix^n] + ^s<Mip 2 x 2 [n] + zi[n], (27) 

its achievable rate is given by 

0f(snr,p)=log 2 fl+ PlS " r O , (28) 

where the denominator of the fraction under the logarithm amalgams the power of the noise 
Zi[n] as well as the interference created by the signal y/p^x^n], which is possible because both 
are independent Gaussian variables. 

User / = 2 (with snr 2 > snri) can also decode message xi[n] and remove it from the received 
signal 

y 2 [n] = y/sm^x[n) + z 2 [n], (29) 
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the decoding of his own message x 2 [n] relies then on the interference-free signal 

y' 2 [n} = 2/2M - y/sr\r 2 [n]p 1 x 1 [n] = \/sm 2 [n}p 2 x 2 [n} + z 2 [n], (30) 
thus, the resulting rate is 

0^ c (snr,p) = log 2 (1 +p 2 snr 2 ) . (31) 

Since user / = 2 discards the message contained in Xi[n], decoding xi[n] does not contribute 
to his throughput. 

Parameters p = [pi,p 2 ] determine the power allocated to users and the level of interference 
user I = 1 experiences due to the signal x 2 [n] of user I = 2. We emphasize here that the powers 
are allocated to both users using solely instantaneous values of SNRs. That is, we do not attempt 
to take advantage of the channel dynamics, done by the so-called water-filling algorithms. 

To get an insight into the potential gains, Fig. Q] compares the rates achievable with SC and 
TS. 




log(l+snr 2 ) log(l + snr!,) 

</> 2 (snr,p) 



Fig. 1. Rates <^i(snr,p) vs. (j>2(snr,p) for p = [1 — P2,f>2] achievable with SC and TS and snri < snro < snr 2 . All rates on 
the corresponding curves are achievable varying P2 G (0, 1). 

All pairs of transmission rates (</> 1 (snr,p), 2 (snr,p)) on the curves corresponding to TS 
and SC, can be obtained varying p 2 E (0, 1). Of course, the interpretation of the parameter p 2 
depend on the MCS: for TS p 2 has a meaning of a time fraction, while for SC it represents a 
fraction of the transmit power. Clearly, for any given rate 0i(snr,p), using SC, the rate of the 
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remaining user 2 (snr,p) can be always greater when comparing to the rate obtained via TS. 
Exception are p 2 = (only user / = 1 transmits) and p 2 — 1 (only user / = 2 transmits), when 
TS and SC are equivalent. 

Moreover, we note that the advantage of using SC instead of TS becomes important when the 
difference between the SNRs of both users increases (note the difference between the SC and 
TS curves for snr 2 > snr 2 ); in fact, for snri = snr 2 , TS and SC are equivalent. 

SC transmission can be generalized to the case of L > 2 as follows: the transmitted signal is 
given by 

L 

x[n\ 

i=i 

where Ya=\Pi = 1 - 

Assuming snri < snr 2 < . . . < srir^, the decoding by the user / is done similarly to the case of 
L = 2: the signals of weak-SNR users are decoded and subtracted in a successive-interference- 
cancellation approach.! while the signals of strong-SNR users are treated as interference. Then, 
the rate of a reliable transmission to user I is given by 

0f c (snr,p)=log 2 fl + ^^-), (33) 

where 

L 

j=i+i 

denotes the total power of users l+l, 1+2, . . . , L. For convenience of notation, in what remains, 
we use 

i-i 

$> fc = 0; (35) 



k=l 



here, it means that p L = 0. 



B. Resource allocation for L = 2 

We now consider the case of RA for L = 2, which is relatively simple to derive and reveals 
the more general relationships that will be used for arbitrary L. 

'When L = 2, this is done only for I = 2. 
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If sni"! < snr 2 , using (1281 ) and ( f3TT ) in ([201) we have to solve the following maximization 
problem 

(l+p 2 snr 2 )^ 

p 2 = argmaxlog — (36) 

1*6(0,1) (l+p 2 snr 1 )^ 

where, the constraint pi + p 2 = 1 is taken into account and — to alleviate the notation — we omit 

time indices, i.e., snr fc = snr fc [n] and (3k = (3k[n\. 

Similarly, for snr 2 < snri, we need to solve 

(1+ Pl snri)ft 

pi = argmaxlog-^— (37) 



Pie(o,i) U+pisnr 2 



After a simple algebra, the solution of (1361 ) is given by: 



' a . a Asnrx /3 2 snr 2 

1, if /3i</?2V— <— 

1 + sni"! 1 + snr 2 

P2 = < 0, if /3isnri>/3 2 snr 2 (38) 

/3 2 snr 2 - ftsnr! 

— — , otherwise. 

I snrisnr 2 (^i - /3 2 ) 

Fixing (3 = [/?i,/3 2 ], the solution p depends solely on the values of the SNRs. In Fig. [2l we 
illustrate how snr affects the choice of p2 and compare SC with TS. Depending on the values 
of snr and (3 we may obtain the solution equivalent to TS (where we transmit to only one user) 
or to SC where we transmit to both users simultaneously. 

C. Arbitrary number of users 



Our objective function in (1201) is now defined as 

L 

y L (p) = ^A0f c (snr,p). (39) 

i=i 

where we use L = {1, 2 . . . , L} -the set integers from 1 to L, to emphasize that all the users 
are considered as the candidates for RA; later, in Sec. IIV-D1 we will consider the optimization 
under restrictions on the users that may be scheduled. 

Then we have to solve the following optimization problem 

L 

p = argmaxyi^p), s.t. y^Pi = 1, Pi > 0. (40) 

p i=i 
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-A_ 

pi- fa snri 



Fig. 2. Resource allocation may be seen as a mapping snr — » p. Here, L — 2 and /3i > /?2 and RA is defined via ( 1381 ), 
The light-shaded regions correspond to the solution of the problem d36t (solved under assumption snr2 > snri) where only 
one user is scheduled for transmission (pi — 1 or p2 — 1), The unshaded region corresponds to the case where we use SC, 
i.e., pi,p2 £ (0, 1). In the dark-shaded region we have snr2 < snri so the solution is found solving the problem d37t : through 
symmetry to the first case of ((38}, if snr2 < snri and /3i > /?2 we set f>\ = 1. The thick dashed (red) line separates the decision 
regions of TS: above the line we schedule the user I = 2 (thus f>2 — 1), while below the line we schedule user I = 1 (pi = 1). 



Applying the KKT conditions, as done also in Il4ll . we know that there exists a Lagrange 
multiplier A (associated with the constraint J2t=iPi = 1) anc ^ multipliers /ii, I — 1, . . . , L (each, 
associated with the constraint pi > 0) such that the optimal solution of (l40l) satisfies 

dyiip) 



dp 



-\ + Hi = 0, (41) 



where for brevity, we use ^eXp) A \p=p- In (|4T1) fii > 0; if \i\ > we say that the 



dpi dpi 

positivity constraint p\ > is active, and then pi = 0. If jii = the constraint is inactive. 
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Our problem will be then solved in two interconnected steps: 

1) First, we find indices I of the users which are not scheduled for transmission, that is, for 
which the positivity constraints are active (where we can thus set pi = 0). 

2) Next, we show that the remaining users have inactive positivity constraints and we explain 
how to calculate their optimal power fractions pV 

Using (|4TT) we can conclude with respect to the parameters p; for which the positivity con- 
straints are active. Namely, for any j, k e L we have 

^0>>%»U A = o. (42) 

OPj opk 

Assuming without any loss of generality that j > k, and after simple algebra, the left-hand 
side (l.h.s.) of (l42l) can be expressed as follows: 

dy^P) . dytXp) snrjPj ^ 1+^snr,- 
<?Pj dpfc snr fc /^ ^ i l+p fc snr fc 

where X]f=fc a i — ® takes care of the case j = k + 1, and V/ are arbitrary real numbers. 

We want to establish conditions under which the inequality (|43T) is satisfied irrespectively of 
p k , which will allow us to identify the elements of p to be made equal to zero, i.e., pk = or 

Pi = 0. 

Proposition 1: If j > k and J2i=l+iPi = 0' men me following relationships hold: 

Pk < Pi v T k < T j Pk = 0, (44) 

P k > Pj A z/ fe > ^ p,- = 0, (45) 



where 



I/* = snr fc /3 fc , (46) 
1 + snr fc 



A v k 

Tfc = — (47) 



Proof: cf. Appendix. 

Definition 1: Denote by p^*. the solution of 

1 + psnr,- Uj 
l+psnr fc v k 

with respect to p. 



(48) 
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Proposition 2: If k < j < m, Yli=k+iPi = ®> an( ^ YlT=j+iPi = 0' tnen me following holds: 

< Pj ,k < Pm,j < 1 => Pj = 0. (49) 

Proof: cf. Appendix. ■ 
Proposition Q] and Proposition |2] allow us to "purge" users whose power fractions are zero 

pi = 0. This can be done via simple element-by-element comparison between the parameters 

Pki v ki r fc> an d Pj,k using the algorithms we define below. 

To simplify the description of the algorithm, it is convenient to define a set £ = {£±, £ 2 , . . . , £k} 

as the ordered set that gathers indices to K "non purged" users, i.e., for which we did not 

determine if p£ k — 0, k — 1, . . . , K. Then, purging user / G C is equivalent to the elimination of 

his index / from the set C, which we denote as £ <r- £\l. 

We start with Algorithm \T\ which eliminates users according to (|44)) . After this first purge, we 

use Algorithm |2] which enforces (l4~5T ). Finally, we need to purge users using Proposition |2] and, 

to this end, we proceed using Algorithm |3] 

Algorithm 1 Purging users according to (l44l 
Input: Pi,ti 

Output: Removes indices I from the set C according to ( l44l >. 

1; IL 

2: j <- L 

3: fc <- J - 1 

4: while k > 1 do 

5: if ft > /3j A r fe > Tj- then 

6: j <- 

7: else 

8: £ <- 

9: end if 
10: k <r- k - 1 
11: end while 



It is immediate to see that each of the above algorithms is executed using at most L element- 
by-element comparisons. The total complexity is then linear in L. 

After executing Algorithm |3] K = \C\ users remain unpurged. We can now determine the 
optimal power-fractions. 
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Algorithm 2 Purging users according to (145b 
Input: V\ , C 

Output: Removes indices I from the set C according to ( |45| >. 

1: K <- \C\ 

2: k <- 1 

3: j «- ft + 1 

4: while j < K do 

5: if v ik < v tj then 

6: ft <- j 

7: else 

8: £ <- 

9: end if 
10: j - J ■ 1 
11: end while 



If K — 1, i.e., there is only one user with non-zero power fraction, i.e., pg 1 — 1. 

Proposition 3: After applying Algorithm [T] Algorithm |2] and Algorithm |3] the positivity 
constraints of all users remaining in the set £ are inactive, i.e., their Lagrange multipliers are 
fi ik = 0,k = l,...,K. 

Proof: cf. Appendix. ■ 
Then, to find the power-fractions we can use the following. 

Proposition 4: If the number of users which are not purged via Algorithm [T] Algorithm HI 
and Algorithm |3] is greater than one (K > 1), the optimal solution of the problem in (l40t is 
found using the following rule: 

Vi K = P£kM-i ( 5 °) 
Vh =PeiA-i -Ph+av l = 2,...,K-l, (51) 
p tl = l- P£ 2>il (52) 

Proof: From Proposition |3l we know that p£ k = 0. Then, the optimality conditions in (14TI) . 
combined with (|48T) . yield pi j} t h = p tk - This immediately yields the relationships in (l50l) . (ISTI) . 
and (|52l). ■ 

Example 2 (Optimal solution for L = 7): Suppose we have L = 7 users with the following 
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Algorithm 3 Purging users according to (1491 if K > 2 
Input: pk,j,C 

Output: Removes indices I from the set C according to d49i l. 

1: K <- \C\ 

2: k <r- 1 

3: j «- k + 1 

4: m <- fc + 2 

5: while m < if do 

6: if P£ m ,£j < P£j.i k then 

7: k j 

8: else 

9: £ <- 

10: end if 
11: j i— m 

12: m •<— m + 1 
13: end while 



numerical values 

snr = [1.7, 3.3, 4.4, 6.7, 7.7, 8.3, 8.6] (53) 

/3 = [6.0, 29.7, 26.5, 15.4, 4.6, 17.6, 12.2] (54) 

We are thus able to calculate 

v = [10.2, 98.0, 116.6, 103.2, 35.4, 146.1, 104.9] (55) 

t = [ 3.8, 22.8, 21.6, 13.4, 4.1, 15.7, 10.9]. (56) 

Running Algorithm [D we obtain 

snr=[x, 3.3, 4.4, x, x, 8.3, 8.6] (57) 

v = [x, 98.0, 116.6, x, x, 146.1, 104.9], (58) 

where we use "x" to denote the irrelevant values corresponding to the purged users. 
Using v from (l58l in Algorithm |2] we obtain 

snr=[x, 3.3, 4.4, x, x, 8.3, x] (59) 

(3 = [x, 29.7, 26.5, x, x, 17.6, x]. (60) 
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The non-purged users are now indicated by the set C = {2, 3, 6}, so we use ( 1591 ) and (1601 in 
(1481) to calculate 

p 6i3 = 0.09, p 3 ,2 = 0.40, (61) 
and, after applying Proposition 0] we obtain the optimal solution 

p 6 = 0.09, p 3 = 0.31, p 2 = 0.60. (62) 

Example 3 (Two groups of users and proportional fairness): We assume now that there are 
two groups of users, labeled "A" and "B". Each is composed, respectively, of L A and L B users 
having the same respective average SNRs, snr A and snr B . In Fig. [3l we show the throughput per 
user in each group: R A and R B , for RA strategies based on RR, PF-TS, and proportionally fair 
SC (PF-SC). 

We make the following observations: 

1) The advantage of PF-SC over PF-TS is well pronounced when sfTf A and sfTf B differ 
significantly as then, SC is most likely to provide notable gains. This is a reminiscence of 
the broadcasting results for a fixed SNR shown in Sec. IIV-AI 

2) Increasing the SNR of one group with respect to the other, the most significant throughput 
increase is obtained by the users in the least populated group irrespectively of their SNR: 
their throughput grows by up to 100% with respect to PF-TS. For example, in Fig. |3k, 
we observe that increasing the SNR of group "B", the throughput of users in group "A" 
improves by 100%. This can be interpreted as follows: SC tends to choose users with 
different SNRs as then the improvements over TS are notable. Consequently, in the two- 
groups scenario, most likely one user from group "A" and one user from group "B" will 
be chosen. Thus, users in the least populated group are scheduled for transmission more 
frequently. 

3) All users are drawing benefits from PF-SC while this is not always the case for PF-TS. In 
fact, the improvement in the throughput of group "B" is obtained by PF-TS at the expense 
of the throughput of group "A" which decreases for large values of snr B . 

Since in PF-SC various users are simultaneously scheduled for transmission^ using SC, it 

2 We reuse the term "scheduling" to indicate that the power-fraction is not set to zero. 
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Fig. 3. The throughput obtained using RR, PF-TS, and PF-SC resource allocation policies for a) La = 4, Lb — 16 and 
b) L A = 16, L B = 4; snr A = OdB. 



would be interesting to define how many can be simultaneously scheduled to allow the gains in 
Fig. [3] to materialize. 

To this end, we denote by K A and K B the number of users scheduled for transmission in 
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groups "A" and "B", respectively. In Fig. |4l we show the empirical probability of the events 
corresponding to different pairs (K A , K B ) that are the most likely to occur and we observe that 

1) In most cases, the number of users scheduled for simultaneous transmission is relatively 
small: Pi{K A + K B < 3} > 0.7. It is an important observation as SC with a small number 
of users might be realized via practical MCS such as the standard-defined hierarchical 
modulation J6]|. 

2) The event K A +K B = 1 means that only one user is scheduled, which is likely to happen for 
snr B ~ snr A , i.e., where SC and TS are equivalent. The probability of using SC increases 
when sn7 B increases, i.e., when the difference between SNRs becomes significant. 

3) The most likely to be scheduled are users taken from group "B" (K B = 2 or K B = 3) but 
even then, one of the users from group "A" is also scheduled. This explains the gains of 
PF-SC: while we privilege high-SNR users from group "B"; we still feed data to low-SNR 
users from group "A" using SC. 

We not that the number of scheduled users does not convey the whole information about the 
RA outcome as it does not reflect the values of the power-fractions pi which, indeed, can be 
very small. In particular, let us define 

P 2 = max{pj +p k }- (63) 

j,keC 

as the maximum power attributed to two users. We show in Fig. |5] the empirical probability 
Pr{P 2 £ V}, where V is the interval of power values. We can observe that, even if the probability 
of having more that two users scheduled for transmission in the scenario L A = 4, L B = 16 is 
relatively large (Fig. Hk), the power assigned to additional users (beyond the first two users) 
is small. In fact, in 90% of the analyzed cases, the first two users obtain more than 80% of 
the power. We do not show the case L A = 16, L B = 4 for which Pr{P 2 £ (0.9, 1]} > 0.95, 
i.e., almost all the available power is assigned to the first two users. 

D. RA under constraints on the number of scheduled users 

The numerical results in Example |3] indicate that, with the optimal RA, not only the number of 
users scheduled for transmission is small; but also the power of the first two users is dominant. 
This is interesting as, in practice, the number of superposed signals should not be very large. 
Thus, motivated by these results, we would like to obtain the RA algorithm where we limit the 
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number of users scheduled for transmission to a small value K max , and next, we will evaluate 
the penalty introduced by this additional constraint. 
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snr B 



Fig. 5. The height of the shaded area corresponds to the probability that the power of the two strongest users P2 belongs to 
the inteval V\ La = 4 Lb = 16, and stTFa = 0. 



Our objective thus is to find the optimum indices C 



where 



and 



C = argmax y c (p c ), 

£g]L«inax 



p c = argmax y c (p) 
v 

\c\ 

k=l 



k=l 

pi = if / ^ C 



(64) 



(65) 



(66) 
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with L being a J^-fold Cartesian product of L. 

This problem is more difficult than the optimization without constraint on the maximum 
number of allowed users K max . While the solution of (|66T ) has a linear complexity with |£|, we 
have to repeat it for all the elements of the set L Km ' dX ; the overall complexity is then proportional 
to L Km ™. 

To avoid this polynomial complexity, we propose the greedy optimization algorithm described 
in Algorithm |4] starting with the empty set C = we add one user at a time to maximize the 
overall objective function. While suboptimal, this algorithm provides a better solution than the 
TS-based RA. This is because the first user which is added to the set C is the one we find in 
the TS approach, cf. (123T ). Other users are added to the set C solely if their presence improves 
the cost function. If this is not possible, and the objective function does not increase (i.e., the 
power- fraction attributed to the optimal user found in step [4] is zero p~ t = 0) the algorithm stops. 

Algorithm 4 Greedy maximization of the objective function: indices of active users are added 

to the set C one-by-one. 
Input: K m . dx 

Output: Suboptimal solution of the problem in (l64l >. 

1: C <- 

2: K <r- 

3: while K < A max do 

4: I <- argmax ie]L Mc y {CM (P {c ,i} ) 

5: if V{c,t}{P{C,t}) > y^iPc) then 
6: C 4- {£, 1} 

7: 

8: else 

9: Stop 
10: end if 

11: end while 



Example 4 (Downlink transmission to users in a cell): Let us compare now PF-SC, PF-TS, 
and RR resource allocation strategies in a scenario which will highlight the most important 
properties of the proposed RA beyond the simplified case of two groups of users we considered 
in Example [3l 
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Consider the case when L users are distributed over a circular cell with a normalized radius 
dmax = 1- We fix the SNR at the edge of the cell to srTr(<i max ) = OdB and the average SNR at 
distance d is given by sfTf(d) = dr v , where the path loss exponent is set to v = 3 |[T0ll . [fT4|] . To 
avoid singularity (infinite SNR) at d = 0, we set snr(d) = srTr(cZ min ) if d < d min where d m { n = 0.1, 
and the maximum average SNR is thus snrfrfmin) = 30dB. 

We assume that the users are uniformly distributed over the cell and since only their distance 
d to the BS is important, we generate the latter as d = y/x, where x is uniformly distributed in 
(0, 1). The positions of the users are randomly generated A^ rep = 1000 times. Next, for all users 
whose distance falls into the interval [d — A, d + A], we calculate the throughput averaged over 
TVrep realizations of users' positions. We denote it by R(d) and show in Fig. |6] for PF-TS, PF-SC, 
and RR resource allocation strategies with L = 50. 

These results are in line with the conclusions obtained from Example [3j the least populated 
groups of users (i.e., those close to BS) experience the greatest improvement in their throughput. 
For the case we analyze, when d < 0.35 the increase is greater than 100% and in the vicinity 
of the BS we obtain a 300% throughput gain. 

At the same time, the throughput of all users is improved irrespectively of their distance d. 
This results in an increase of the aggregate throughput of the cell that we show in Fig. [7] as a 
function of the number of users L. We can appreciate that with respect to PF-TS, the aggregate 
throughput of PF-SC increases by 50% when L > 100. 

As we have seen in Example [3l SC tends to schedule more users with strong SNR, while 
keeping at least one weak-SNR user served. This explains the results of two-users SC (denoted 
as SC2): the penalty due to the constraint on the number of users K max = 2 is more notable for 
strong-SNR users and is less important for users that are far from the BS. Quite interestingly, 
there are no important differences between the throughput obtained via heuristic two-users RA 
described in Algorithm |4] and the optimal complex enumeration (|64|) . 

V. Conclusions 

We analyzed the problem of transmitting information to multiple users over a shared down- 
link wireless channel using SC. We solved the problem of allocating the power to the users 
maximizing the criterion of sum of utility function and we have shown examples based of the 
criterion of proportional fairness. The proposed resource allocation algorithm easily deals with 
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Fig. 6. The throughput as a function of the normalized distance d of the user from the BS; L = 50, the average SNR at the 
cell's edge is given by stTr(l) = OdB; SC2 refers to SC under constraint K max = 2, "Opt." to the optimal exhaustive search 
J 64b . and "Greedy" to the results obtained via Algorithm [4] 



a very large number of users and we illustrated its operation with numerical examples showing 
a rate increase from 20% and up to 300%. 

Appendix 

Proof of Proposition [7J- It is convenient to rewrite (1431) as 

> fj,kG>k), (67) 



snr fc /3 fc 
where the function 



psnr k 

is monotonically growing for p e (— 1/snrfc, oo). 



/i,*(p) = ' ( 6g ) 

1 + nsnr t 
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Fig. 7. The aggregate cell throughput as a function of the number of users L; the average SNR at the cell's edge is snr = OdB. 



Therefore, to prove (1441 ) we have two cases to consider 

1) For (3 k < (3j it is immediate to see that 

snr,-/3 7 - snr 7 

sm k /3 k snr fc p^oo p6 [ ,i] 
so (1671) is satisfied for any p k G [0, 1] and thus p k = 0. 

2) For /3 k > f3j, to satisfy (1671) irrespectively of p fc , we need the following 



snr,- 



(69) 



(70) 



snr fc /3 fc pe[o,i] 1 + snr fc 

which is equivalent to r k < tj. 

To prove (|45l ), we note that if we satisfy 

snr fc p fc pe[o,i] 

then —^-<fj,k{Pk) ls satisfied irrespectively of p k , and ( 1711 ) is equivalent to Vj < v k . 
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This terminates the proof. ■ 
Proof of Proposition [2]" 

We establish first a simple relationship, namely, from Definition [TJ we obtain S ™ T J = fj,k(Pj,k)- 
Then, (l67l) is equivalent to fj,k{Pj,k) > fj,k(Pk)- Because of the monotonicity of fj,k(p), the latter 
is also equivalent to the following conditions 

Pj,k >% =^ Pk = (72) 

Pj,k <V k => Pi = °» ( 73 ) 

where we know that p fc e [0,1]. 

To prove Proposition [2] we proceed by contradiction: suppose that < pj^ < p m ,j < 1» 
= and Y^j+iPi = 0- But, we suppose that pj > 0, and from d73l we obtain 
Pj,fc > Pfc> and from (1721 we get p mj - < Thus, 



> Pm,j > Pj,fc > Pfc = Pj + Pj, (74) 

where the last equality follows from (|34l and Yli'iZt+iPi = 0- To satisfy (1741) , we must set pj = 0; 
which contradicts the assumption pj > 0. 

This terminates the proof. ■ 
Proof of Proposition \3\~ 

We proceed by contradiction. Suppose there is a non-empty set J = {ji, . . . , jk 1 }, which 
contains subsequent indices to the non-purged users with active positivity constraints, i.e., £j l 6 
£, / = 1, . . . , K', and fi^ > 0,1 = 1, . . . , if', and for any k £ J we must have £ fc < ^ or 

There are three possible cases then 

1) ji > 1, jx' = and there is k — ji — 1 such that ^ = 0. 

2) jx = 1, < fT, and there is m = jk> + 1 such that fi im = 0. 

3) ji > 1, jk< < K', and there are k = ji — 1 and m = j K i + 1 such that ne k = and 
H m = 0. 



May 6, 2013 



DRAFT 



28 



In case [Oh we know that 

dyiip) > dy L (p) 
dp k dp h 

s" r jA < 1 +^i snr i 1 (76) 
snr fc /3 fc l+p^snr* 

— ^ < 1, (77) 

where the transition from (1761 ) to (f77l is based on the fact that jV' = K. Thus, , = p^ = 0. 
Since (1771 ) is equivalent to Uj 1 < u k , this means that j\ cannot be in the set £ as it would be 
purged via Algorithm |2] This is a contradiction, so case [Q cannot occur. 
In case [2]), we know that 

dyi(P) > dyL(p) g 
dp m dp jK , 



snr m /3 m ^ l+p jK ,snr m 



snr m /3 m 1 + snr m 

> T~ ) VoUJ 



snx 3 K ,Pi Kl l + snr jx , 

where the transition from (T79T ) to <180T > is based on the fact that j\ = 1. Thus, = p^ = 1. 
Since (l80l) is equivalent to , < r m , this means that j^-/ cannot be in the set £ as it would be 
purged via Algorithm [T| This is a contradiction so case |2) cannot occur. 
In case [3]), we know that 

> ~%T (81) 

~a > ~a ' (° z ) 

OPm Op jKl 



therefore, 



snr m /3 m l+p jjf ,snr m 
snr . J. > 1+ % snr . > P;* (83) 

where (1831) is obtained from (|72T ). and (I84T ) is obtained from (|73|) . Since p^ = Py combining 
([83]) and © yields 

PWm-i >P4+i,4- (85) 
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Since the following relationship must hold after running Algorithm [3] 

PhA > Plz,h > • • • > Pt K ,tK-ii 

(86) 

(f8~5l) is in contradiction with (l8~6l) . which means that case HJ cannot occur. 

Since none of possible cases can occur, we arrive at a contradiction with the assumption of 
having active constraints among non-purged users; this terminates the proof. ■ 
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